Prediction of Conserved Motif Networks based on Comparative Genomics
نویسندگان
چکیده
Discovery of regulatory motifs in unaligned DNA sequences remains a fundamental problem in computational biology. Most motif finding algorithms perform well in finding over-represented substrings in a small collection of sequences. This requires some prior knowledge about the functional relations among genes in a study in order to select a good collection of promoters. Microarray gene profiling and Chromotin-IP are often used to provide a list of sequences that may be enriched for binding sites of some transcription factors (TFs). However, any limitation in such experiments will propagate to the motif finding step, preventing us from finding motifs for many TFs, as well as from getting a global view of motif network in the whole genome. To date, over 200 yeast TFs have been assayed, but less than 100 have confident motif predictions.
منابع مشابه
Eukaryotic regulatory element conservation analysis and identification using comparative genomics.
Comparative genomics is a promising approach to the challenging problem of eukaryotic regulatory element identification, because functional noncoding sequences may be conserved across species from evolutionary constraints. We systematically analyzed known human and Saccharomyces cerevisiae regulatory elements and discovered that human regulatory elements are more conserved between human and mou...
متن کاملComparative genomics of human stem cell factor (SCF)
Stem cell factor (SCF) is a critical protein with key roles in the cell such as hematopoiesis, gametogenesis and melanogenesis. In the present study a comparative analysis on nucleotide sequences of SCF was performed in Humanoids using bioinformatics tools including NCBI-BLAST, MEGA6, and JBrowse. Our analysis of nucleotide sequences to find closely evolved organisms with high similarity by NCB...
متن کاملPredicting regulons and their cis-regulatory motifs by comparative genomics.
We have combined and compared three techniques for predicting functional interactions based on comparative genomics (methods based on conserved operons, protein fusions and correlated evolution) and optimized these methods to predict coregulated sets of genes in 24 complete genomes, including Saccharomyces cerevisiae, Caenorhabditis elegans and 22 prokaryotes. The method based on conserved oper...
متن کاملSALAD database: a motif-based database of protein annotations for plant comparative genomics
Proteins often have several motifs with distinct evolutionary histories. Proteins with similar motifs have similar biochemical properties and thus related biological functions. We constructed a unique comparative genomics database termed the SALAD database (http://salad.dna.affrc.go.jp/salad/) from plant-genome-based proteome data sets. We extracted evolutionarily conserved motifs by MEME softw...
متن کاملPredictive regulatory models in Drosophila melanogaster by integrative inference of transcriptional networks.
Gaining insights on gene regulation from large-scale functional data sets is a grand challenge in systems biology. In this article, we develop and apply methods for transcriptional regulatory network inference from diverse functional genomics data sets and demonstrate their value for gene function and gene expression prediction. We formulate the network inference problem in a machine-learning f...
متن کامل